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CN | The walk distances in graphs are defined as the result of appropriate transformations 

of the Yl'kLo(tA) k proximity measures, where A is the weighted adjacency matrix of a 
graph and t is a sufficiently small positive parameter. The walk distances are graph- 
geodetic; moreover, they converge to the shortest path distance and to the so-called 
. long walk distance as the parameter t approaches its limiting values. We also show that 

the logarithmic forest distances which are known to generalize the resistance distance 
and the shortest path distance are a specific subclass of walk distances. On the other 
hand, the long walk distance is equal to the resistance distance in a transformed graph. 

Keywords: Graph distances; Walk distances; Logarithmic forest distances; Transitional 
measure; Resistance distance; Network 

MSC: 05C12, 05C50, 15B48 



o 

u 

■i— > 



1 Introduction 



X 



The classical distances for graph vertices are the shortest path distance [3], the resistance 
CNj \ distance [22,29,39-41], which is proportional to the commute time distance [20], and the 
\ square root version of the resistance distance [32,47,48]. Recently, a need for a wider variety 
of graph distances has been strongly felt (see, e.g., [15,18,49,52,53] among many others). 

Recall the well-known fact that the shortest path distance and the resistance distance 
coincide on each tree. In particular, for every path, the resistance distance between every two 
adjacent vertices is one, as well as the shortest path distance. However, in some applications 
two central adjacent vertices in a path may be considered as being closer to each other than 
two peripheral adjacent vertices are as there are more walks (of length 3, 5, etc.) connecting 
two central vertices. Such a "gravitational" property holds for the forest distances [11]. In 
some other applications, a terminal vertex in a path can be considered as being closer to its 
neighbor than two central adjacent vertices are. For example, if someone has a single friend, 
then this friendship is often stronger than that between persons having more friends. This 
heuristic is supported by the logarithmic forest distances [4]. 

In [5], a general framework was proposed for constructing graph-geodetic distances 1 (a dis- 
tance d(i,j) for graph vertices is graph-geodetic whenever d(i,j) + d(j,k) = d(i,k) if and 
only if every path connecting i and k visits j). Namely, it has been shown that if a matrix 
S = (sij) produces a strictly positive transitional measure on a graph G (i.e., Sy Sjk < Sik % 
for all vertices i, j, and k, while Sjj Sjk = Sik Sjj if and only if every path from i to k vis- 
its j), then the logarithmic transformation hij = lnsjj and the inverse covariance mapping 



1 ln this paper, a distance is assumed to satisfy the axioms of metric. 
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dij = ha + hjj — hij — hji convert S into the matrix of a graph-geodetic distance. In the case 
of digraphs, five transitional measures were found in [5] , namely, the "connection reliability" , 
the "path accessibility" with a sufficiently small parameter, the "walk accessibility" , and two 
versions of the "forest accessibility". The distances produced by the forest accessibility on 
weighted multigraphs (networks) were studied in [4]. 

In [10] we applied the inverse covariance mapping to the matrices of walk weights 
Sfclo(^) fc > wnere A is the adjacency matrix of a graph, and showed that this leads to dis- 
tances whenever the positive parameter t is sufficiently small. However, these distances are 
not graph-geodetic and some of their properties are exotic (see Section 10). 

In the present paper, we study the class of graph-geodetic walk distances, which involves 
the logarithmic transformation. Sections 2 and 3 contain definitions and preliminaries, in 
Section 4 the walk distances are expressed in terms of commute cycles and via block matrix 
operations. Sections 5 and 6 are devoted to two limiting cases of walk distances: the short 
walk distance coincides with the classical shortest path distance, while the long walk distance 
is original. In Section 7, we consider modified walk distances (the "e-walk distances" ) which 
generalize the classical weighted shortest path distance. In Section 8, it is shown that adding 
"balancing loops" converts the logarithmic forest distances into a subclass of walk distances. 
This implies, in particular, that the resistance distance is also a limiting walk distance, as 
shown in Section 9. In Section 10, several graph metrics are compared on a simple example. 



In the graph definitions we mainly follow [23]. Let G be a weighted multigraph (a weighted 
graph where multiple edges are allowed) with vertex set V(G) = V, \V\ = n > 1, and edge 
set E{G). Loops are allowed; throughout the paper we assume that G is connected. For 
brevity, we call G a graph. For i,j G V(G), let G {0,1,...} be the number of edges 
incident to both i and j in G; for every q G {1, . . . , n^}, > is the weight of the qth edge 
of this type. Let 



(if riij = 0, we set a^- = 0) and A = A(G) = (aij) nxn ; A is the symmetric weighted adjacency 
matrix of G. In this paper, all matrix entries are indexed by the vertices of G. This remark is 
essential when submatrices are considered: say, "the ith column" of a submatrix of A means 
"the column corresponding to the vertex % of G" rather than just the "column number i" , 
which may differ. 

By the weight of a graph G, w{G), we mean the product of the weights of all its edges. 
If G has no edges, then w(G) = 1. The weight of a set S of graphs, w(S), is the total weight 
(the sum of the weights) of its elements; w(0) = 0. If the weights of all edges are unity, 
i.e. the graphs in S are actually unweighted, then w(S) reduces to the cardinality of S. The 
weights of sequences of vertices and edges and of their sets are defined similarly. 

For vo,v m G V(G), av ->f m path (simple path) in G is an alternating sequence of vertices 
and edges vq, e±, v i, . . . , e m , v m where all vertices are distinct and each is a (i>j-i, i>j) edge. 



2 Notation 
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The unique v o — > f o path is the "sequence" t>o having no edges. The length of a path is the 
number m of its edges. The weight of a path is the product of the weights of its edges. The 
weight of a v q — > v path is 1. 

Similarly, a v o — >■ t> m walk (sometimes also called a route, cf. [5]) in G is an arbitrary 
alternating sequence of vertices and edges v ,ei,Vi, . . . ,e m ,v m where each e^ is a (i>j_i,i>j) 
edge. The length of a walk is the number m of its edges (including loops and repeated edges). 
The weight of a walk is the product of the m weights of its edges. The weight of a set of 
walks is the total weight of its elements. By definition, for any vertex t> , there is one Vq — > Vq 
walk vo with length and weight 1. 

We will need several special types of walks. A hitting v — > v m walk is a v o — > v m walk 
containing only one occurrence of v m . A v — > v m walk is called a vo — > vq cycle if 2 v m =t)o- 
A v o — > v o cycle is called a v q v m commute cycle if it contains v m and has no occurrences 
of Vq strictly between the first appearance of v m and the final appearance of Vq. 

Let r\j be the weight of the set TZ l i of all i — > j walks in G, provided that this weight is 
finite. R = R(G) = (rij) nxn will be referred to as the matrix of walk weights. 

By d s (i,j) we denote the shortest path distance, i.e., the length of a shortest path between 
i and j in G. The weighted shortest path distance d ws (i,j) is defined as follows: 3 



where the minimum is taken over all paths it from i to j and the sum is over all edges e in 7r; 
l e = l/w e is sometimes called the weighted length of the edge e, where w e is the weight of this 
edge (see, e.g., [13]). In the theory of electrical networks, l e is identified with the resistance 
of the edge e, while w e is its conductivity. 

Definition 1 ([5]). Given a graph G, we say that a matrix S = (s i j)eW axn determines the 
transitional measure s(i, j)—Sij, i,j<EV, for G if S satisfies the transition inequality 4 " 

&ij Sjk — $ik ifji k V 

and the graph bottleneck identity with respect to G : 



holds if and only if all paths in G from % to k contain j. 

The transition inequality is a multiplicative analogue of the triangle inequality for prox- 
imities [9,10] also called the "unrooted correlation triangle inequality" [16]. 

Definition 2 ([5]). For a multigraph G with vertex set V, a function d: VxV — > M. is called 
graph- geodetic provided that d(i,j) + d(j,k) = d{i,k) holds if and only if every path in G 
connecting i and k contains j. 

In the following section, we define the class of walk distances and present a number of 
preliminary results needed in the subsequent study. 

2 Such a walk is also called a closed walk. We use the term cycle for simplicity; this usage is common in 
computer science. 

3 This formula corrects Eq. (6.2) in [29]; cf. [27, Section 4]. 

4 If S has positive diagonal entries, then the transition inequality is equivalent to s'^ s'- k < s' ik , where 




TT 



(2) 



ee£(7r) 





'33 



i,j,ke V. 
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3 The walk distances 



Recall that is the weight of the set TZ^ of all i — > j walks in G provided that this weight 
is finite, R = (rij) nxn being the matrix of walk weights. 

For any t > 0, consider the graph G(t) obtained from G by multiplying all edge weights 
by t. If the matrix R t = R(G(t)) = (rij(t)) nxn exists, then 5 

oo 

R t = Y,(tA) k = (I-tA)-\ (3) 

fc=0 

where / denotes the identity matrix of appropriate dimension. 

By assumption, G is connected, while its edge weights are positive, so Rt is positive 
whenever it exists. Assuming the finiteness of Rt, apply the logarithmic transformation to 
the entries of R t , namely, consider the matrix 

H t = bit (4) 

where p(s] stands for elementwise operations, i.e., operations applied to each entry of S 
separately. Finally, consider the matrix 

D t = ^(h t l T + lhj) - H t , (5) 

where h t is the column vector of the diagonal entries of H t (the trace vector of H t ), 1 is the 
vector of ones of appropriate dimension, and ftf and 1 T are the transposes of h t and 1. An 
alternative form of (5) is D t = (U t + U^)/2, where U t = h t l T — H t , and the elementwise form 
is dij(t) = \{hu(t) +hjj(t)) - h i:j (t), i,j e V(G), where H t = (h^t)) and D t = (d^t)). This 
is a standard transformation used to obtain a distance from a proximity measure (cf. the 
inverse covariance mapping in [16, Section 5.2] and the cosine law in [14]). 

In the rest of this section, we present several known facts (lemmas) which will be of use 
in the study of the walk distances. The first lemma follows from Theorem 6 in [5]. 

Lemma 1. For any connected graph G, if the matrix R t = (^-(i)) of walk weights in G(t) 
exists, then R t determines a strictly positive transitional measure for G. 

According to Theorem 1 in [5], if S— (sij) nxn determines a transitional measure for G and 
has positive off-diagonal entries, then D = (dij) nxn defined by D = \{hl T + lh T — H — IT 2 ), 

where H = In is a matrix of distances on V(G). Moreover, by Theorem 2 in [5] this 
distance is graph-geodetic. Along with Lemma 1 this implies the following lemma, which 
appears in [5] as item 2 of Corollary 2. 

Lemma 2. For any connected G, if Rt = ij'ijit)) exists, then the matrix D t = (dij(t)) defined 
by (3) -(5) determines a graph- geodetic distance d t (i,j) = dij(t) on V(G). 

5 For an early study of the graph proximity measure Y^k=o^^) k ' we re f er the reader to [25,26,34,50,51]. 
More recently, it has been explored in [10,12,17,53]. On counting walks, see also [24] and on its applications 
in chemistry, [28]. 
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Definition 3. For a connected graph G, the walk distances on V(G) are the functions d t (i,j) : 
V(Cr) xV(G) K. and the functions positively proportional to them, where d t (i,j) = dij(t) 
and _D t = (dij(t)) is defined by (3)-(5). 

Regarding the finiteness of R t , since for a connected graph, A is irreducible, the Perron- 
Frobenius theory of nonnegative matrices provides the following result (cf. [51, Theorem 4]). 

Lemma 3. For any weighted adjacency matrix A of a connected graph G, the series Rt = 
^2*kLo(tA) k with t > converges to (I — tA)~ l if and only if t < p~ 1 , where p = p(A) is the 
spectral radius of A. Moreover, p is an eigenvalue of A; as such p has multiplicity 1 and a 
positive eigenvector. 

Eigenvalue p = p(A) is called the Perron root of A. If x is an eigenvector of A associated 
with p, then the probability vector p — x/\\x\\i is called the Perron vector of A. 

Lemma 4. For any vertices i,j G V(G) and < t < p^ 1 , 

d t (i,j) = -\J / Vij ^ ) 
W^r^t)) 

Lemma 4 is a corollary of (4) and (5) (cf. Eq. (11) in [5]). The author is grateful to 
Michel Deza for mentioning the genetic distance by Nei [33], which has a form similar to (6). 

Lemma 5 appeared in [5] as Eq. (23). Despite its simplicity, it plays an important role in 
the subsequent study. 

Lemma 5. // the matrix R = (r^) exists, then for any vertices i,j G V(G), 

r % 3=r %3 (i)r j3 , (7) 
where r^^ = w('R?^) is the weight of the set of all i — > j hitting walks in G. 



(6) 



4 Two expressions for the walk distances 

The first result enables one to interpret the walk distances in terms of specific walks in G. 
Technically, it is a consequence of the previous lemmas. 

Theorem 1. For any t G]0,p _1 [, the matrix of walk distances D t has the representation 

D t = -\HR il)t Rl^ = -\\^, (8) 

where R(\)t = (fij(i)(t))nxn is the matrix of hitting walk weights in G(t), = (cfj(t)) nxn , 
and cfj(t) = w{Ct~^) is the weight of the set C\*~^ °f a ^ i j commute cycles in G(t). 

Proof. By Lemma 3, if < t < p^ 1 , then the distance matrix D t exists and by Lemmas 4 
and 5, for any vertices % and j we have 

1 r 2 (t) 1 
dt(i,j) = -2 ln ril (t) rjj {t) = ~2 ln ^ ij{1) ^ rji{1) ^ ' (9) 

5 



Observing now that there is a natural bijection between TZt^ x Tit ^ an d Ct~ j we obtain 

<%(t) = r m {t)r m (t), i,jeV(G), (10) 
which implies (8). □ 

By virtue of Theorem 1, there is a certain analogy between the walk distances and the 
classical commute time distance. One of the consequences of Theorem 1 is that w(Cl^ J ) < 1 
whenever R t exists and % ^ j. A "topological" interpretation of the walk distances is presented 
in [8]. 

The following result provides an expression for the walk distances which will be of use in 
the sequel. 

Theorem 2. For any connected G, any vertices i,j e V(G), and any t G]0,p _1 [, 

d t (i,j) = -\ Inftr 1 / - A- 3 ^a- 03 (r 1 / - A^J 1 ^), 

where Mjj is the submatrix of M obtained by the removal of row j and column j, M~ l is the 
ith row of M _1 , and is the jth column of A with a ]3 - removed. 

Proof. Theorem 2 is immediate from (9) and the following lemma. □ 
Lemma 6. In the notation of Theorems 1 and 2, 

r im (t) = (t- 1 I-A 3} )r 1 a Jj (11) 

whenever t e ]0, {p{Ajj))~ l [. 

Proof. Observe that any i — > j hitting walk in G(t) can be uniquely decomposed into: 
(1) some % — > k walk in the subgraph Gj(t) of G(t) obtained by the removal of vertex j and 
all edges incident to it and (2) a (k, j) edge. If < t < (p(Ajj))" 1 , then the total weights of 
the % — >■ k walks in Gj(t) form the ith row of (7 — tAjj)^ 1 , whereas the total weights of the 
(k, j) edges (with k ^ j) form the vector ta^. The desired expression follows. □ 

When considering graph distances, of major interest are the proportions of distances 
for different pairs of vertices rather than the distances themselves. On the other hand, for 
studying the limit properties, it is convenient to consider, among the positive multiples of 
d t (i,j) (see Definition 3), the specific walk distances d™(i,j) with: "W" referring to "walk", 

dZ(i,j)=6dt(i,j), (12) 

a being the parameter connected with both t and p by 

a = (r 1 -p)-\ (13) 

and 9 being the scaling factor given by 

6 = \n(e + ai)%—^. (14) 
In a 

The factor 9 as a function of a and n is assumed to extend to a = 1 by continuity: 

9 = ln(e + 1) whenever a — 1. 

These parameterization and scaling will prove convenient in the following sections. In par- 
ticular, it is worth mentioning that they ensure comparability of the walk distances with the 
logarithmic forest distances [4] (cf. Section 8). 
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5 The short walk distance 

Consider the behavior of the walk distances d™(i,j) as a — > + (t — >• + ). The corresponding 
limit of d™(i,j) (if it exists and provides a distance) can be termed the short walk distance 
because t — > + leads to neglecting long walks in (3). It turns out that the short walk 
distance coincides with the classical shortest path distance d s (i,j). 

Theorem 3. For any vertices i,j G V, 

lim dZ(i,j)=d\i,j), 

a->-0+ 

where d s (i,j) is the shortest path distance between i and j in G. 

Proof. For any vertices % and j^i, let m = d s (i,j). Let r^ be the ij-entry of A m . Using 
Lemma 4 and (12)— (14) yields 

hm d™(t,j) = km -m(e + a™)- In — ^-j — JJ \\ — - 

a ^o+ aK,JJ a ^o+2 v ' In a r ij{{p + a ) ) 

= — km (ma) In — „ , , 
2^o+ V 4( a ) 

1 ,. n Hl (l + o(l))(l + o(l)) 
= — hm ma In 

2a^o+ v {a m rlf + o(a m )) 2 

= lim (lna) _1 (mlna + lnr,-" 1 - 1 ) = m, 

wkere o(/(a)) are suck terms tkat — > as a — > + . Tkis completes tke proof. □ 



6 The long walk distance 

Consider tke asymptotic bekavior of tke walk distances as a — > oo (t — > (p^ 1 )"). First, tke 
bebavior of Rt = X]fcLo(^) fc is clear from tke following lemma. 

Lemma 7. For any connected graph G, 

lim (t _1 — p)R t = lim a~ x R t = ppp T , (15) 

t_).(p-l)- a-s>oo 

where p = p/\\p\\2, P = (pi, ■ ■ ■ ,Pn) T is the Perron vector of A, and (13) is used. 

Proof. Eq. (15) can be easily derived from, say, Tkeorem 3.1 in [31] and tke fact tkat pp T is 
tke eigenprojection of A corresponding to p (see also [36,37]). Wken applying tkis tkeorem, 
to verify tkat tke limit in (15) exists, one skould observe tkat tke index of A at p is 1 since 
A is diagonalizable as a Hermitian matrix. □ 

Wkile tke entries of Rt (tke total weigkts of walks between vertices) tend to infinity as 
t — > (p^ 1 )", tke weigkts of kitting walks and commute cycles remain finite. 
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Corollary 1 (of Lemma 7). In the notation of Theorem 1, for any vertices i,j G V, 

lim r ij{1) (t) = ^, (16) 

lim cfAt) = 1. (17) 
t-Kp- 1 )- 3 

Proof. Combining (7) and (15) yields 

lim r m) (t)= lim ^ = = -■ 

In view of (10), Eq. (17) also holds. □ 

It follows from (8) and (17) that the distances d^it) vanish as t — > (p _1 )~ in spite of 
the infiniteness of R t . Furthermore, since A is irreducible, p{Ajj) < p(A) for any j e V [19, 
Ch. Ill, § 3.4]. Consequently, by (11) and (10), pi— Ajjis non-singular, rjj(i)(p _1 ) and c^(p _1 ) 
make sense, and so Eqs. (16) and (17) should be supplemented by 

nmip- 1 ) = |, (18) 

c%{p~ l ) = 1. (19) 

Substituting (19) in (8) results in D p -i = 0. However, by (13), a is indefinite at t — p' 1 , so 
da(i,j) defined by (12) is indefinite as well. Therefore lim t _^ p -i)- d™(i,j) = lim^oo d™(i,j) 
is worth evaluating. Let us study this limit. 

We define the long walk distance d LW (i,j) as follows: 

d™(t,j)= hmC(U), i,jeV, (20) 



a— >oo 



provided that the limit exists and induces a distance function. In Theorem 4, we obtain a 
closed formula for d hW (i, j); after that we give it an interpretation and discuss two examples. 

Theorem 4. For any vertices i,j<EV such that i ^ j, 

d LW ^j) = n-^p-^-'pj + pj 1 ^)-^,), (21) 

where C = pi — A, p = (pi, . . . ,p n ) T is the Perron vector of A, pj is p with pj removed, and 
the other notation is the same as in Theorem 2. 

Proof. Using Theorem 2, (12)— (14) and the Taylor expansion we obtain 
d™(i,j) = lim j) 



| 2 (X — 1 

- lim - ln(e + «»)- lnftr 1 / - A^a^ (r 1 / - A^j 1 ^) 

a^oo 2 mo; JJ j ' 

-rT l lim aln^Cjj)' 1 - a" 1 ^)' 2 + 0(0*-% a sj 

X [(^)- 1 -o ; - 1 (^)- 2 + (Q ; - 1 )] j% ) 

-vT 1 lim alnlpijpji - a~ 1 [Y(j) i Y{j)a lj p ji + p^Y^Y^a^ + o(a -1 )), 



where 

Y(i) = {C^)-\ ieV (22) 

and 

Pij = Y(j) iajj , i.jtW i / ./. (23) 
To proceed, we need the following lemma. 

Lemma 8. 

p tJ = -, i,jeV(G), i^j, (24) 
Vj 

where pij is defined by (23) and p = (pi, ■ ■ ■ ,p n ) T is the Perron vector of A. 

Lemma 8 can be proved using (11) and (18), however, it is more instructive to give a 
direct proof. 

Proof. As p is the Perron vector of A, it obeys Cp = 0. Removing the j'th equation from this 
linear system and rearranging all the p^-'s to the right side yields CjjPj = Pj a jj- Therefore, 
since Cjj is non-singular, pj x Pj = Y(j)a^ holds, as required. □ 

Using Lemma 8 we now complete the proof of Theorem 4: 



d LW (i,j) = -n 1 lim ln(l - a \ Pi 1 p j Y(j) iPj 1 p J + P j 1 p i Y(i) j p i 1 p I ]) a 

= -n~ l In exp (-pr 1 Y(j) 4 Pj - p" 1 Y"(i)j Pl ) 

= n- 1 {p-\Cj,)-% + pj 1 (A,)" V) , 
as desired. □ 

The symmetric irreducible singular M-matrix C = pi — A plays a central role in this 
paper. It can be termed the para-Laplacian matrix of G. C has rank n — 1 and is positive 
semidefinite. 

Expression (21) can be written in a more elegant form. 
Corollary 2 (of Theorem 4). For any vertices i,j&V such that i + j, 

d™(ij) = n- 1 [(pi - Bjj)' 1 + (pi - B^j 1 ] 1, (25) 

where B = P~ X AP and P = diagp. 

The proof of Corollary 2 is straightforward. It should be noted that Q = (pP)~ 1 AP is a 
stochastic matrix which can be naturally attached to G. In terms of Q, one can write: 

d™(t, j) = (up)' 1 [(I - Qjj)- 1 + (/ - Q^J 1 ] 1, j ? i. 

Now let us give an interpretation of d hW (i,j) in terms of walks. 
Denote by C 1 ^ the set of all cycles (i.e., closed walks) in G(p~ l ) that 
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• start and finish at i; 

• consist of two consecutive walks such that the first one does not contain j and finishes 
at some vertex k which is marked; the second one does not contain i, except for its end 
vertex. 

Let = w(C^) be the weight of the set C i{j \ 

Corollary 3 (of Theorem 4). For any vertices i,jeV such that i ^ j, 

d LW (i,j) = (n P )-\c lij) + c> {i) ). 

Proof. Let yf^ be the fcth element of Y(j)i = (Cjj)' 1 , where k G V\ {j}. It follows from 
the proof of Lemma 6 (Section 4) that pyf^ is equal to the total weight of the % — >■ k walks 
in Gj(p^ 1 ). Using Theorem 4 and (18) we obtain 

k&V, k^j fce V, k^i 

k£V, k^j keV, k^i 

= (npy^c'W + tf®), (26) 
as required. □ 

By virtue of Corollary 3, it can be said that d hW (i,j) is proportional to the sum of the 
weights of certain walks starting at i, avoiding j, and then returning to i and certain walks 
starting at j, avoiding i, and then returning to j. 

Corollary 4 (of Theorem 4). For any connected graph G, the function d LW (i,j) is a metric. 

Proof. Since d LW (i,j) is a finite limit of distances, it suffices to prove that d LW (i,j) ^ when- 
ever j 7^ i. This follows from the non-emptiness of C 1 ^ for all i and j ^ i and Corollary 3. □ 

Example 1. For the unweighted path P4 (Fig. 1), we find using Theorem 4 that 
d LW (l,2)/d LW (2,3) = (1 + VE)/2, the golden ratio. 

1- -2- -3- -4 1- -2- -3- -4- -5 

Pa P5 
Figure 1: The paths P 4 and P 5 . 

In general, it can be shown that the long walk distance between central adjacent vertices 
in a path is smaller than that between peripheral adjacent vertices. For example, for P 5 
(Fig. 1), rf LW (l,2)/rf LW (2,3) = 2. This distinguishes the long walk distance (and all walk 
distances) from the logarithmic forest distances (cf. the remark "On the 'mixture' of the 
shortest-path and resistance distances" in Section 6 of [4]). Since the long walk distance is 
the limit of graph-geodetic distances, the distances between non- adjacent vertices % and j 
in a path are equal to the sum of distances between the subsequent vertices in the subpath 
connecting % and j. 
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Example 2. Consider the weighted paths with four and five vertices and the edge weights 
shown in Fig. 2. 

Weights: V2 1 V2 Weights: V2 1 1 V2 

1- -2- -3- -4 1- -2- -3- -4- -5 

Pa P 5 
Figure 2: Two weighted paths, P 4 and P 5 . 

The results are as follows: for P 4 , d LW (l,2) = d LW (2, 3) = 0.75; for P 5 , d LW (l,2) = 
d LW (2, 3) = 0.8. The same pattern is preserved for all weighted paths of this kind. Say, for 
Pio with the two terminal weights \/2 and the other weights 1, all the long walk distances 
between adjacent vertices are 0.9 (and — - for P n , n > 2). Thus, the weights of \/2 com- 
pletely compensate the "extremality" of path's terminal vertices with respect to the long 
walk distance. 

7 The e-walk distances which generalize 
the weighted shortest path distance 

In Section 3, the graph G(t) was constructed by multiplying all edge weights in G by t. Now 
consider a more sophisticated transformation: 6 

w{a) = ^e~^, a > 0, (27) 

where w is any edge weight in G, w(a) is the weight of the corresponding edge in the 
transformed graph G(a), and p is the Perron root of the weighted adjacency matrix A of G. 
The total edge weights dij(a) = Y^q=i w ij(. a ) f° r an pairs of vertices form the weighted 
adjacency matrix A(a) of the transformed graph G(a) (cf. (1)). The matrix of walk weights 
of this graph, R a = (rij(a)) nxn , provided that it exists, has the representation 

oo 

R a = Y,A{a) k = {I-A{a)Y l . (28) 

k=0 

When the series in (28) converges, we define the modified walk distance d™(i,j) by means of 

H a = 9 a a\nR a , (29) 

where 9 a is a positive scaling factor, 

D a = ±(h a l T +lhl)-H a , (30) 

6 Obviously, for (27), lim a ^ + w(a) = and liiria-^oo w(a) = w/p as well as for the transformation 
w(a) = tw = w j (p + a^ 1 ) which we used earlier (see (13)). However, the rate of convergence as a function 
of w for these two transformations cardinally differs. 
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where h a is the column vector of the diagonal entries of H a (cf. (4) and (5)), and 

C(i,j)=4-(a), (31) 

where (c%(a)) nxn = D a . 

d™(i,j) are graph-geodetic distances on V(G) since Lemmas 1 and 2 remain valid for R a . 

Definition 4. For a connected G, the e-walk distances on V(G) are the functions d™(i,j) = f 
d™(i,j) defined by (27)-(31). More generally, a modified walk distance is a distance that fits 
within the framework of (28)-(31) with some edge weight transformation w(a). 

The following expression for d™(i,j) is analogous to the representation (6) of the walk 
distances. It is easily obtained by combining (27)-(30). 

Lemma 9. For any a > 0, 

d?(i,j) = -9 a aiJ r f"l Y i,jeV,i^j, 

where 



m r andw v are the length and the weight of the walk r, respectively, d r = Xlces(r) ^ e > ^ e = w c 
and E(r) is the multiset of the edges of r (r may have repeated edges). 



For the e-walk distances, an analogue of Theorem 2 holds (and has a similar proof). 
Lemma 10. For a connected G, i,j G V(G), and any a > 0, in the notation of Theorem 2, 

j) = ~ H 1 - M^WM^ (I - A{a)^a{a)- U ). (32) 

Lemmas 9 and 10 are used in the proof of the following theorem describing the limiting 
properties of the e-walk distances (which differ from those of d™(i,j)). Suppose that for the 
e-walk distances d™(i,j), 9 a is such that 

lim 6 a = 1 and lim 6 a = 9 OQ eR+. (33) 

Theorem 5. For any vertices i,j<EV such that j ^ i, 

lim d™(i,j) = d™(i,j), 

a— 5>0+ 

where d ws (-, •) is the weighted shortest path distance (2) and 

^JTihj) = ^{p-'iiC^A^ + pj^ic^A^p, (34) 

where C = pi — A, p is the Perron vector of A, A — (dij) nxn results from A by replacing 
every nonzero entry by 1, and A\ is A with the ith row removed. 
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Proof. Using Lemma 9 and (33), for any vertices i and j ^ i we obtain 

T ■ ■ ( OL ) 

lim d e ™(i, j) = — lim a In — = = = — lim ahiriAa). (35) 

a^0+ ° a^0+ v /(l + o(l))(l + o(l)) ^0+ JK 

Observe that if r, r' G 'ft. 1 "' and, in the notation of Lemma 9, d r > < d r , then for all sufficiently 
small a > 0, p~ mi 'w r ,e~ di '^ a > p~ mi w v e~ d ^ a holds. Consequently, there exists «o > such 
that for all a G ]0, «o[ and some /%(«) satisfying 1 < f%(a;) < | 7^ JJ ' |, 

(a) = (a) p~ m " w-^^ (36) 

is true, where r G 7^*"' is a walk such that either 

(a) df < d r or 

(b) d f = d T and p~ mf w Y > p~ mi w T 

holds w.r.t. all r G 7^- By definition (2), in this case, d f = d ws (i,j). Using (35) and (36) we 
obtain 

lim d™(i,j) = - lim a( ln(« ij (a) p~ mf w f ) - d-Ja) = d ws (i,j). 
Now we prove (34). Using a different parameterization of the function (27): 

w(a) = w{pf) = , (37) 

where 7 = a -1 , observe that 

w'(0) = -p~\ (38) 

where ^'(7) is the derivative of w{j) with respect to 7. Denote by ^(7) the weighted adja- 
cency matrix of the graph modified through (37). As a — > 00 (7 — > + ), Eqs. (37) and (38) 
and the definition of A yield 

A(a) T3 = A(0)jj- a' l p~ l A^ + o(a~ 1 ) = p _1 (A 37 - aT 1 ^) + o(a~ 1 ), j G V. (39) 

For the vector a(a) S j (the jth column of A(a) with a(a)jj removed) this implies that 



a(a 



, p \a 3] - a l dj j + o(a j G V. (40) 



Substituting (39) and (40) in (32) and denoting by the ith column of A with a i 
removed result in 

rv 

lim d?(i,j) = lim - ln([J - p-\A- f3 - 7^) + 0(7)]^ P' 1 ^ - 7% + 0(7)) 

a^oo a— too 2 



x[I-p 1 {A rt - 1 A u ) + o{ 1 )] J 1 p \a- u - 1 a- u + o{ 1 ))) 



= -6^ lim ^ ln([£ JJ + -yA M + 0(7)^ 1 (a Jj - 7^ + 0(7)) 
x + 1 A ri + o(7)]J 1 (a a - 7%, + 0(7))) . 

Observe that when 7 — )■ + , 

((As + 7 A*) - T^wXA* + 7^n) _1 = I ~ iMCjt)- 1 + 0(7) 
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is true, from which 

(C ri + 1 A U )' 1 =Y(i)- 1 Y(t)A- lI Y(t) + o( 1 ), jeV (41) 

holds, where Y(i) = (A*) -1 (see (22)). 

Using (41), (23) and (24) and denoting by p T the Perron vector p of A with p i removed, 
we can now complete the proof: 

lim d?(i,j) = -9^ lim - ln([F(j) - " V(, ). l /; V(, - >ya) 

x [Y{i) - 7 n^«^)L' (%i - 7%) + 0(7)) 

= lim hnr^^-a-^TOM^p-^ + yOO,^^ 

«-*°°2 VpjPi V 7 1 /', JJ Pi 

+[Ym T ,] j p- 1 p T ^ + YiiYa-J-pi + oia- 1 ))" 
Pj pj / / 

which coincides with the desired expression. □ 

Is there any connection between the limiting e-walk distance (34) and the long walk 
distance d hW (i,j) defined by (20)? Let 

d^(t,j)= lim C(i,i), i,jeV(G) (42) 

(LeW is the abbreviation for "long e-walk"). In fact, d hew (i,j) is a distance, which is guar- 
anteed by Theorem 6. Prior to formulating this theorem, we provide a "topological" inter- 
pretation of d Lew (i,j). 

Recall the interpretation of the long walk distance d LW (i,j) given by Corollary 3 (Sec- 
tion 6) in terms of specific cycles in G(p~ l ). Such a cycle belonging to C 1 ^ is an i-to-i cycle 
that consists of two consecutive walks: the first one does not contain j and finishes at some 
vertex k which is marked; the second walk does not contain i, except for its end vertex. Let 
us take such a cycle and remove the edge connecting k with the subsequent vertex in the 
cycle. Let C 1 ^ be the set of resulting sequences and let c 1 ^ = w(C 1 ^) be its weight. Each 
element of C 1 ^ can be treated as a "cycle with a jump". Indeed, one can imagine a point 
moving along the cycle, reaching k, and then jumping to the next vertex instead of traversing 
the edge leading to it. 

Corollary 5 (of Theorem 5). For any vertices i,j<EV such that % 7^ j, 

2p 

Similarly to the long walk distance, the long e-walk function <i Lew (i, j) is large when the 
set comprising specific % — >■ % cycles avoiding, on the first stage, j along with specific j — > j 
cycles avoiding, on the first stage, i is "heavy" . 
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Proof. Using Theorem 5 we obtain (cf. the proof of Corollary 3): 

Q . ^ ^ 

d Lew {i,j) = ^( X] py ( fk^k q r qi{1) (p- 1 )+ pyflakqTgj^ip- 1 )) 

9 



2p \ 

r k,q£V,k^j k,q£V,k^i 



_ »(c*(j) + p(0) > (43) 
2p 

as required. □ 
Theorem 6. In the notation of (20), (33) and (42), and Theorem 5, if 

*-= 2 - (44) 
n p T Ap 

then for all vertices i,j G V, 

d™ r (i,j) = d™(i,j). 

Remark 1. Observe that — ^ — is the weighted average, with weights PiPj, of the nonzero 
entries a^j p of A/ p. Since, by assumption (33), lim a _^ + 9 a — 1, a scaling factor 9 a in (29) 
that ensures d Lew (i,j) = d LW (i,j) can be defined, for instance, as follows: 

9 00 a + (3 
a + p 

where 9^ is given by (44) and (3 is a positive parameter. 
Proof of Theorem 6. Let 

Vk q = 5kjpy\i r^p- 1 ) + 5 ki pyf k r qj{l) (p- 1 ), k,qEV, 
where is the kih element of Y(j)i = (jCjj)^ 1 and 



1, k^j, 
0, k = j. 



It follows from (26) and (43) that for any vertices % and j ^ i, 

d™(i,j) n9^ £ Vl j q a kq 

k,q&V 



(45) 



Using (18) one can represent the vector rfj = {rf* , . . . , r/nqY m the form 

ri% = p{njY-+m j -) = £E3 -(Y{j) i Pj + Y(i) i Pi), t,J,qeV, (46) 
\ Pi pj J p^j 

where Y(j) is Y(j) supplemented by row j and column j consisting of zero entries and Y(j) 1 
is the ith column of Y(j). The rest of the proof is based on the following lemma. 



15 



Lemma 11. Vector Y(j) z pj + Y(i) 3 pi with j ^ i is a positive multiple of the Perron vector p. 

Proof. Performing multiplication of block matrices and using (22)-(24) one can verify 
that CY(jYpj is the vector whose ith element is pj, the jth element is —pi, and the other 
elements are zero. Similarly, in the vector CY(iypi, the jth element is Pi, the ith element 
is — pj, and the remaining elements are zero. Therefore, £(Y(j) l pj + Y{i)ip^} = and so 
Y(jYpj + Y{iypi is a positive (as d LW (i,j) > 0) multiple of vector p spanning Ker£. □ 

By (46) and Lemma 11, every vector n l J is proportional to p. Owing to the factor p q 
in (46), every row of the matrix (r)^ q ) nxn indexed by k and q is proportional to p T . Therefore 

(vl J q )nxn = HijPP T , (47) 
where factor of proportionality. Substituting (47) in (45) leads to the result. □ 

8 Logarithmic forest distances as a subclass 
of walk distances 

For a graph G and a parametric family of functions (f a : R+ — > a G A C R, consider the 
matrices 

Q a = {I + L a )~\ (48) 

where L a = diag(A Q l)— A a and A a are the Laplacian and weighted adjacency matrices of the 
graph G a that differs from G by the edge weights only: 

W a = <p a (w) 

for any edge weight w in G and the corresponding weight w a in G a . 

The logarithmic forest distances on V{G) determined by the parametric edge weight 
transformation ip a are obtained [4] from the matrices Q a through the familiar conversions 

H a = 9h^Qt, (49) 
where 9 is a positive scaling factor generally depending on a and G and 

D a = ±(h a l' + 1/i'J - H a . (50) 

The simplest edge weight transformation (p a (w) = aw, a > determines [4] a specific 
family of logarithmic forest distances whose limiting cases are the shortest path distance and 
the resistance distance. 

In this section, we establish a connection between the walk distances and the logarithmic 
forest distances. 

Let us say that G is a balance-graph of G if G is obtained from G by attaching some 
loops and assigning the loop weights that provide G with uniform weighted vertex degrees. 
More formally, V(G) = V(G), E(G)CE(G), the edges in E{G) have the same weights in G, 
E(G) \E(G) is comprised of loops, and A(G) has constant row sums. Balancing G by loops 
will mean constructing any balance-graph of G. 
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Theorem 7. For any connected graph G, the family of logarithmic forest distances (48) -(50) 
with any edge weight transformation y? Q (u>) coincides with a certain family of modified walk 
distances (28) -(31) obtained through balancing the graphs G a by loops. 

Proof. For each a G A, choose any 

m a > max(£jj(o;) | % e V(G)), (51) 

where £a(a) are the diagonal entries of L a = L(G a ). Since G is connected, m a > 0. Set 

A(a) = (m a + l)- 1 (m a I-L a ). (52) 

Obviously, A(a) defined by (52) is the weighted adjacency matrix of the graph with loops 
G a obtained from G by transforming the edge weights in accordance with 

w(a) = (m a + l)"Va(w), 

attaching a loop to each vertex % such that m a > £u(a), and assigning the loop weights 
(m a + l)~ 1 (m Q — £u(a)); such weights provide A(a) with constant row sums m a /(m a + 1). 
Thereby, the G Q 's are obtained from the G a 's through balancing by loops. 
The Perron root of A(ct), m a /(m a + 1), is less than I. Consequently, 

oo 

ik^ = {I-A(a))- 1 (53) 

fc=0 

is a finite matrix of walk weights in G a . 
Substituting (52) into (53) yields 

R a = (m a + l)(I + L a )' 1 = (m a + l)Q a . (54) 

Passing Q a through the conversions (49)-(50) leads to the logarithmic forest distance 
with parameter a. Passing R a through the same conversions fits within the framework of 
(28)-(30) and so it generates a modified walk distance with parameter a. 

Finally, observe that the multiplier (m a + l) in (54) does not survive the conversions (49)- 
(50), so the two above distances coincide. Considering the whole domain A of a leads us to 
recognize that the initial family of logarithmic forest distances and the family of modified 
walk distances constructed by means of R a and (49)-(50) also coincide. □ 

The following corollary applies to the simplest case of logarithmic forest distances in 
which (p a (w) = aw and so L a = aL [4, Section 2], where L = (£ij) — L\ (a — 1) is the 
Laplacian matrix of G. 

Corollary 6. For any connected graph G, if (p a (w) = aw, then the family of logarithmic 
forest distances (48) -(50) with A = R+ and 9 given by (14) coincides with the family of walk 
distances (12) calculated for any balance-graph of G. 
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Proof. Following the lines of the proof of Theorem 7, denote m\ by m (see (51)). Since 
w a = aw, for every a > we have ma.x(£a(a) \ i<EV(G)) = amax(4(l) | i&V(G)) < am. 
So in (51) one can set 

m a = am, a > 0. (55) 
Define A(a) by (52) and (55) and let 

A a = a~ 1 (am + l)A(a). (56) 

Then 

A a = ml — L, p(A a ) = m, (57) 

where p(A a ) is the Perron root of A a . Thus, A a does not depend on a; denote it by A. 
Substituting (56) into (53) yields 

R a = (I -a(am+ l^A)' 1 = (/ - (p + a -1 )" 1 !)" 1 , 

where p = p(A). Setting, in accordance with (13), t — (p + a -1 ) -1 we have 

R a = (l-tA)-\ 

which coincides with (3) for the graph G whose weighted adjacency matrix is A. Passing R a 
through the conversions (49)-(50) with 9 given by (14) provides exactly the walk distance (12) 
with parameter a. Passing Q a through the same conversions results in the logarithmic forest 
distance under consideration, which, by Theorem 7, coincides with the above walk distance. 
Since this holds for every a G A, the two families of distances coincide. 
Finally, observe that by (57), 

A = ml - diag(Al) + A, (58) 

thus, G can be constructed by attaching a loop to each vertex i such that m > £ u and 
assigning the loop weights that provide G with uniform weighted vertex degrees. Obviously, 
each balance-graph of G can be obtained in this way. The corollary is proved. □ 



9 Connections between long walk distance 
and resistance distance 

9.1 Resistance distance as the long walk distance 
in a balance-graph 

It follows from Corollary 6 that the logarithmic forest distances in G with edge weight 
transformation (p a {w) = aw coincide with the walk distances in any balance-graph of G. 
Since by Proposition 3 in [4], the resistance distance is a limiting case of the logarithmic forest 
distances, the resistance distance can be obtained within the framework of walk distances. 
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Corollary 7 (of Theorem 7). For any connected G, the resistance distance in G coincides 
with the long walk distance d LW (i,j) defined by (20) in G, where G is any balance-graph of G. 

Corollary 7 is immediate from Proposition 3 in [4] and Corollary 6. It enables one to 
apply to the resistance distance any result obtained for the long walk distance. In particular, 
Corollary 3 of Section 6 (with p = m, where m is the uniform weighted vertex degree of a 
balance-graph of G) provides a kind of topological interpretation of the resistance distance, 
whereas Theorem 4 gives the following expression. 

Corollary 8. For any connected graph G on n vertices, let L be the Laplacian matrix of G 
and let d r (-, •) be the resistance distance on V(G). Then for any i,j G V(G) such that j ^ i, 

t r(i,j)=n- 1 ((L ss )T 1 + (L a )J 1 )l 

holds, where 1 is the vector of n — 1 ones and (L^)' 1 is the ith row of the inverse principal 
submatrix Ljj. 

Proof. By Corollary 7, the resistance distance in G coincides with the long walk distance in 
any balance-graph G of G. The weighted adjacency matrix A (see (58)) of the balance-graph 
G having all weighted vertexdegrees m is a nonnegative irreducible matrix with row sums m. 
Therefore p(A) = m and p(A) = n _1 l, where p(A) and p(A) are the Perron root and Perron 
vector of A, respectively. Substituting this and (58) into the expression for o? LW (i, j) given by 
Theorem 4 yields the desired equation. □ 

Note that Corollary 8 can also be proved using the results of [2]. 

It follows from the proof of Corollary 6 that the logarithmic forest distance with parameter 
a coincides with the walk distance (12), provided that a is defined by (13) and the graph 
has been balanced by loops. This justifies the reparameterization (13). 

Attaching the "balancing loops" leads to a model with a uniform connection resource 
possessed by all vertices: a lack of external connections is filled up by self-connections. As 
has been seen in this section, in such models, the logarithmic forest distances appear. These 
treat two peripheral adjacent vertices in a path as being closer to each other [4] than two 
central adjacent vertices are. It was mentioned in the Introduction that friendship is one of 
the relationships for which such a model can be considered. It may be appropriate when 
several people have a similar combined resource of friendship + self-absorption, but they are 
not equal in their ability to make friends. In contrast to this, the examples of Sections 6 
and 10 demonstrate that the walk distances are able to treat central adjacent vertices in a 
path as being closer to each other than the peripheral adjacent vertices are, which also may 
be relevant to certain applications. 

9.2 Long walk distance as the resistance distance 
in a modified graph 

The connection between long walk distance and resistance distance is two-way. Namely, the 
following relationship supplements Corollary 7. 
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Theorem 8. Let G be a connected graph on n vertices with weighted adjacency matrix A. 
Suppose that p is the Perron vector of A, p' = y/np/\\p\\2, and P' = diagp'. Then the long 
walk distance in G coincides with the resistance distance in the graph G' whose weighted 
adjacency matrix is P'AP'. 

We first prove two lemmas. 

Lemma 12. Let X = diagx, where x G W 1 is non-negative and x^O. Then in the notation 
of Theorem 8 and Lemma 11, 



d™(ij) = ^# Up-^uWi + (p-'mxU i, j^i 



holds, where P = diagp. 
Proof. For j ^ i, we have 



\ n \\2 r n IUII2 

\P\\2 



np T x 



[p-'Y( 3 )x) l + (p^mx),]i = ^{p^YU) l+ p-^m 3 )x 



np L x 

(PiPjr'iPjYuy+Pinyyx. 



np T x 



By Lemma 11, pjY(j) 1 + piY(iy = ftijp holds, where > is a factor of proportionality. 
Consequently, Theorem 4 yields 

Ml \p^nj)x)^{p--mx)h = ffter^fenA+^w^ 

J up X Pij p p 



np T x 



np T x \\p\\l 
= d™(i,j), 

and the lemma is proved. □ 

Remark 2. It can be shown that in Lemma 12, Y(j) can be replaced by Y(j) = (pi — A^)" 1 , 
where Ay is A with the entries in the jth row and jth column replaced by zero. 

Lemma 12 provides one more formula for the long walk distance, which can be computa- 
tionally cheaper than (25). 

Corollary 9. In the notation of Lemma 12 and Remark 2, 



d LW ^,j) = ^ [((Pi - A s )P)t 1 + ((pi - A^P)' 1 ] 1, j ? i. 

Proof. This follows from Lemma 12 by setting x — 1, which implies X — I. □ 

Lemma 13. Let A(G') = f3PA(G)P, where G is connected, P = diagp, p is the Perron 
vector of A(G), and /3 > 0. Then L(G') = (5PC(G)P, where C(G) = pi - A(G) and p is the 
Perron root of A(G). 
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Proof. Obviously, the non-diagonal entries of L(G r ) coincide with those of f3P£(G)P. 
Finally, /3PC(G)P has zero row sums: f3P£(G)Pl = /3P(pI-A(G))p = /3P(pp-pp) = 0. □ 

Proof of Theorem 8. Setting x = (p^ 1 , . . . ,p~ 1 ) T and using Corollary 8 and Lemmas 13 
and 12 we have 

- lbll ^((P£py: 1 + ((P£Pk): 1 ]i 



n 2 



np T x 

;LW/ 



[(p- 1 Y(j)X) i +(p- 1 Y(i)X)]l 



= d^(i,j), (59) 

as desired. □ 

Theorem 8 enables one to utilize all facts and expressions known for the resistance distance 
to calculate and study the long walk distance. In particular, Corollaries 10 and 11 follow. 

Corollary 10. The long walk distance is graph- geodetic; it is a squared Euclidean distance. 

Proof. Note that the resistance distance has these properties [21,32] and use Theorem 8. □ 

Corollary 11. In the notation of Theorem 8, 

d™(i,j) = (-ir + ^J^K J^h Vu,veV, (60) 



d™(ij) = C+47- 2 4~ ( 61 ) 

d™(t,j) = xl(t,j)(L' m r l x- v (t,j), j^i, Wu,veV. (62) 

where V = P'CP', L'~ = (-^~) is any g-inverse 7 ofL', and x(i,j) is the n-vector whose ith 
element is +1, jth element is —1, and the other elements are 0. 

Proof. This follows from Theorem 8, Lemma 13, and three classical expressions for the 
resistance distance (see [42, Eq. (17)], [38, Theorem 7-4], and [44, Eq. (14)] for (60), [45, 
Eq. (13)] and [35, Theorem 10.1.4] for (61), and [42, Eqs. (14)-(15)], [44, the first part of 
Eq. (16)], and [45, Eq. (15)] for (62); cf. [38, Chapter 7] and [1,29,46]). □ 

For any symmetric irreducible Laplacian matrix L, a simple choice of L~ is (L + J) -1 , 
where J = ^11 T [35, Section 10.1.3]. Another choice is the group inverse L* = (L + J) -1 — J 
(which for L is also the Moore- Penrose generalized inverse L + ). The latter formula, due to 
Sharpe and Styan [43] (see also [35, Theorem 10.1.2] and [6, Propositions 15, 16]), has been 
rediscovered several times. Alternatively, = ^ 2 -l T L= 1 l, i,j = l,...,n [43]. The general 
form of L~ is +al T + 16 T , where a and b are arbitrary n- vectors [44]. In particular, —\D 
is a g-inverse of L, where D is the matrix of resistance distances corresponding to L [44,54]. 

Finally, let us mention three simple expressions for d LW (i,j) in terms of £ obtained in [7]. 



r Z is a g-inverse [35] of X whenever X = XZX. 
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Theorem 9 ([7]). In the notation of Theorem 8, for all i,j 6 V such that j ^ i, 

det 



pf det C- t 



'3 

d™(i,j) = z-(i,j)C-z(i,j), 

d™(i,j) = zl(i l3 )(C m )- l z- v ( h 3), Vu,veV, 

where C = pi — A, C~ is any g-inverse of C, and z(i,j) is the n-vector whose ith element is 
l/p'i, jth element is —1/p'j, and the other elements are 0. 

Since for any balance-graph G of G, p'(G) — l : C(G) = L(G), while L(G) is an equicofac- 
tor matrix, Theorem 9 generalizes the three classical expressions for the resistance distance 
reproduced in Corollary 11. 

More generally, the long walk distance can be considered as the counterpart of the resis- 
tance distance obtained by replacing the Laplacian matrix L = diag(v41)— A and the vector 1 
which spans Ker L with the " para- Laplacian" matrix C = pi — A and the vector p' spanning 
Ker£. If G is balanced, i.e., A has constant row sums, then these distances coincide. 



10 Several metrics on the path of length 3 

The simplest graph on which the difference between the new and classical metrics can be 
illustrated is the path on 4 vertices (Fig. 3). 



Figure 3: The path P 4 . 

Some properties of different metrics on P 4 are summarized in Table 1. 

The walk distances and the logarithmic forest distances are graph-geodetic, so they satisfy 
^ 1 'rf(i~3) 2 ' 3 ' > = 1' smce an paths between 1 and 3 visit 2. Our examples suggest that these 
metrics are useful to model situations where, all other things being equal, the peripherality 
of vertices increases or decreases the distance between them. In such cases, the walk distances 
or the logarithmic forest distances can be used, respectively; in this example, for the former, 
d(l,2) > d(2,3), while for the latter, d(l,2) < d(2,3). 

The forest metrics [9,11] are obtained by the application of (5) to the matrices (J + aL)^ 1 , 
where L = diag(Al)— A is the Laplacian matrix of G. As well as walk metrics, they increase the 
distance between peripheral neighbors, however, the forest metrics are not graph-geodetic. 

The "plain" walk metrics [10] are obtained by the application of (5) to the matrices 
R t — (I — tAy 1 , where t — (p + a^ 1 )^ 1 (see (3) and (13)). Depending on a, they can either 
increase or decrease the distance between peripheral neighbors. Let us note that for P4, they 
set d(l,3) ~ d(l,4) or even d(l,3) > d(l,4) (see the last column of Table 1), which is quite 
exotic and does not meet the geodetic (graph traversal) approach taken in this paper. 
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Metric, a 


d(l,2) 
d(2,3) 


d(l,2)+d(2,3) 
d(l,3) 


d(l,4) 
d(l,3) 


Shortest path distance, 
Resistance distance 


1 


1 


1.5 


Walk distance, a — 1 


1.08 


1 


1.52 


Long walk distance (a — >■ oo) 


1+ 2 ^ ~ 1-62 


1 


1+ f J « 1.62 


Logarithmic forest distance, a = 2 


0.89 


1 


1.47 


Forest distance, a = 1 


1.08 


1.32 


1.26 


"Plain" walk distance, a = 4.5 


1.08 


1.28 


0.95 


"Plain" walk distance, a — 1 


0.96 


1.46 


1.03 



Table 1: The properties of several metrics on P 4 . 



Numerical examples and partial results suggest that the walk metrics and the logarithmic 
forest metrics more sensitively take into account the global structure of the graph than the 
electric metric 8 does. In particular, the distances they provide depend not only on the paths 
between two vertices, but also on their centrality. As a result, these metrics do not coincide 
with the shortest path metric when G is a tree. 
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